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POLYPgPTIDE HftVTNG IMMnNQIflC TCAL AOTTVTTY FOP HfiTi ; 
DIAGWOST TC REAGENT AND/OR VACCTWTi! 

TECHNICAL FIELD 

This invention relates to a polypeptide havirtg 
5 iamunological activity for use as a diagnostic reagent 
and/or a vaccine component. 

BACKGRQTTNn APT 

Diagnostic kits for use in screening individuals for 
infection with human immunodeficiency virus (HIV) infection 
10 frequently include reagents comprising HIV antigens which 
are used to detect antibodies using known immunological 
techniques including ELISA, Western Blot, latex 
agglutination and immuno-luminescent and immuno-fluorescent 
techniques . 

15 The effectiveness of such techniques however depends 
upon selection of suitable immunological reagents and one 
particular difficulty which arises is that particular 
reagents are often specific to individual strains or groups 
of strains of HIV. Thus, for example, known diagnostic 

20 reagents based upon HIV-1 may fail to detect antibodies 
resulting from an infection of a patient with HlV-2. 

Similarly, in the production of vaccines designed to 
protect individuals against HIV infection, the use of 
antigens derived from one particular strain of h±V may fail 

25 to provide adequate protection against infection with other 
strains. 

It is an object of the present invention to overcome 
such problems. 

DISCIOSURE OF T KVENTTQN 

30 It has now been found that the product of expressing a 
substantial part of the HIV-pol gene iii a suitable host has 
antigenic properties which allows the above-mentioned 
problems to be overcome. 

Thus according to one aspect of the present invention 

35 there is provided the use as an antigenic reagent in the 
diagnostic test or as a vaccine component of a polypeptide 
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comprising a substan'kial portion of each of more tlian one of 
the constituent: proteins coded for by the HIV-pol gene. 

Diagnostic kits emd vaccines comprising said polypeptide 
form further aspects of the present invention. 
5 The HIV-pol gene codes for four enzymes, namely a 

protease, a reverse tramscriptase, a ribonuclease referred 
to as PNAse H and an enzyme referred to as Integrase. 

It is believed that during infection of a T cell by HIV 
a full length precursor is expressed which is then cut up 
10 into the discrete proteins listed above. These have the 
following activities and (it is thought) act in the order 
indicated:- 

Protease Precursor Cleavage 

Reverse Trsmscriptase Preparation of viral DNA from viral 
15 RNA 

RNAse H Destruction of viral RNA leaving 

newly synthesised DNA 

Integrase Insertion of said DNA into host cell 

genome 

20 According to a preferred aspect of the present 

invention, said constituent proteins are enzymes coded for 
by the HIV-pol gene and the polypeptide thus comprises a 
substantial portion of each of a plurality of enzymes 
selected from HIV-pol protease, HIV-pol reverse transcrip- 

25 tase, HIV-pol RNAse H and HIV-pol Integrase. Most prefer- 
ably, the polypeptide comprises substantial portions of all 
four of said enzymes. 

In vivo , the initial product of expressing the HIV-pol 
gene is cleaved into its individual elements by the 

30 protease. The active site for proteolytic activity occurs 
adjacent the NH2-terminus of the expression product, 
corresponding to the 5^ -end of the protease gene. 



3 

According to a preferred aspect of the present 
invention, the polypeptide omits at least that part of the 
amino acid sequence of the HIV-pol protease gene which codes 
for the active site responsible for proteolytic activity. By 
5 omitting this portion, the integrity of the polypeptide is 
maintained and it is less liable to degrade. 

BRIEF DE SCRIPTTOW OF DPAWTWgg 

Figure l is a schematic diagram showing the procedure of 
Example 1; 

10 Figure 2 shows the results of electrophoresis tests 
carried out in the manner explained in Example 2; and 

Figure 3 is a graph showing the results of the 
experiments carried but in Example 3. 
BEST MODE FOR CAPPYT NG OUT 'Wng TNVEWTTOW 

15 The HIV-pol gene of several strains of HlV-i has been 
cloned and the corresponding amino acid sequences derived 
from the determined DNA sequences. The amino acid sequences 
of ten strains appear in the accoii5)anying Table l at the end 
of this disclosure, m Table 1, the full sequence of strain 

20 HIV HXB2 is given, whereas for the other nine strains, only 
sequence differences are listed. As used herein, the term 
"constituent protein coded for by the HIV-pol gene" refers 
to a protein having sufficient amino acid homology with the 
sequence of HIV HXB2 appearing in the accompanying Table so 

25 as to result in antibodies raised against the protein cross- 
reacting with a polypeptide consisting of the precise amino 
acid sequence of HIV HXB2. 

The HIV-pol gene can be expressed to produce the desired 
polypeptide by various techniques, e.g. some or all of the 

30 baculovirus techniques described in U.S. Patent 4,745,051 to 
Gale E. Smith et al issued on May 17, 1988; Baculovirus 
Vectors for Expression of Foreign Genes by c. Yong Kang, 
Advances in Virus Research, Vol. 35, pp 177-192, Academic 
Press Inc., 1988; A Manual of Methods for Baculovirus 

35 Vectors and Insect Cell Culture Procedures, Max D. Summers 
and Gale E. Smith, May 1987, Texas A&M University; and 
Baculoviruses as Gene Expression Vectors, Lois K. Hiller, 
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Ann. Rev. Microbiol. 42, pp 177-1991; the disclosxires of 
which are incorporated herein by reference- However our 
Canadian Patent Application Serial No. 591,908 filed on 23rd 
February 1989 (and equivalent British Patent Application 
5 Serial No. 89 04426.7 filed on February 27, 1989 and US 
Patent implication Serial No. 316,768 filed on February 28, 
1989) describes coid claims an improved baculovirus 
expression system capeible of producing foreign gene proteins 
at high levels and the use of this expression system is 

10 particularly preferred for expressing the polypeptide of the 
present invention. 

The process disclosed in our Canadiem patent employs a 
recombineoit baculovirus containing at least a major part of 
a polyhedrin gene promoter region, a transcription 

15 termination sequence of a polyhedrin structural gene, a 
foreign structural gene (e.g. an HIV-pol gene) having a 
trcinslation start codon followed by coding sequences and a 
translation stop codon. The foreign gene is located between 
the promoter region and the termination sequence. 

20 Immediately upstream of the start codon there is a putative 
insect cell ribosome binding site for the polyhedrin gene 
effective for overcoming resistance of susceptible insect 
cells to express the foreign gene at a high level. The 
putative ribosome binding site comprises at least the final 

25 four nucleotides of the sequence 5'-ACCTATAAAT-3 ' . 

Example 3 of the Canadism application describes the 
production of the pol protein of HIV-1 in a baculovirus 
expression system based on Autoorapha califomica 
nucleopolyhedrosis virus (ACNPV) and specifies that a 

30 recombinant baculovirus designated ACNPV-HIV-YK-pol has been 
deposited at the American Type Culture Collection pf 12301 
Parklawn Drive, Rockville MD 20852, USA under Accession No. 
ATCC VR 2233. Deposit was made on November 30, 1988. The 
disclosure of our Canadian Patent Application Serial No. 

35 591,908 is incorporated herein by reference. 

Utilising the procedures described in Example 3 of 
Canadian Patent Application Serial No. 591, 908, a polypeptide 
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comprising the protease, RNAse H and Iritegrase enzymes of 
HIV strain HIV-XB2 may be produced. 

The polypeptide can be iised as a diagnostic reagent or 
vaccine component in ways known to persons slcilled in the 
5 art, e.g. by the techniques indicated in the publication 
entitled Clinica, Testing for HIV and AIDS, The Next 
Five Years, George Street Publications Ltd., Richmond, 
Surrey, UK, the disclosure of which is incorporated herein 
by reference. 

10 The invention is illustrated in more detail by the 

following Examples. Example 1 illustrates the production of 
a modified recombinant plasmid pUC18-Dpol3 having a 2? 3 bp 
deletion at the 5 '-terminus emd its expression as 
polypeptide lacking the firat 91 amino acids at the 

15 NHj-terminxxs of the HIV-pol protease. Examples 2 and 3 relate 
to the expression of the polypeptide and its use as a 
diagnostic reagent. 

EXAMPLE 1 

Constmiction of bacu lovirus transfer vector containing HXV-l 

20 pol gene with 273 bo deletion at 5* terminug 

As illustrated in Figure 1, the Bglll and Sail fragment 
of plasmid pHXB-2D containing the HIV-l pol coding region 
was isolated eind inserted into BamHI emd Sail sites of 
pUC18. The resulting recombinant plasmid (pUC18-Dpol 1) was 

25 cut with Sstl and dephosphorylated. A synthetic double- 
stranded crossover linker containing a Sstl cohesive end, a 
BamHI site^ the putative insect Spodoptera frugiperda (SF9) 
cell ribosome binding site (P) and 15 nucleotides of the 
homology searching sequences which overlaps with the 5' 

30 terminus of the pol gene was ligated at the Sstl site and 
trcinsformed. The recombinant plasmid, (pUC18-Dpol 2) was 
isolated, digested with sPHl, dephosphorylated and ligated 
with another crossover linker DNA containing SphI cohesive 
end at the 3' terminus, BamHI site and 15 nucleotides of the 

35 homology searching sequences which recognise the 3* terminus 
of the pol gene. The resulting recombinant plasmid 
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(pUC18-Dpol 3) contains the putative SF9 cell ribosome 
binding site (P) followed with pol open reading frame 
starting with the first ATG (TI) codon (map unit 2357-2359) 
in the pol gene and the translation termination (TT) codon 
5 TAG (map unit 5093-5095) . This whole cassette was flanked 
with BamHl sites. The BamHl fragment was isolated and 
inserted into the BamHl site of the pAcYMl baculovirus 
transfer vector (pAcYMl-Dpol) . The pAcYMl-Dpol transfer 
vector DNA was used to co-transfect SF9 cells with wild type 
10 AcNPV DNA to isolated recombinant AcNPV HIV-YK pol virus. 

EXAMPIiE 2 

Expression of pqI aene product s bv recombinant: baculoviruses 
Recombinant AcNFV-HIVWHpol contains an insert comprising 
essentially the whole DNA sequence of the HIV-pol gene (see 

15 Table 2 at the end of the present disclosure) . When 

expressed^ the resulting full length gene product of the 
HIV-pol gene is "processed", i.e. the proteolytic active 
site of the HTV pol protease gene cleaves the protein into 
66 JcD, 51 kD and 32 JcD fragments. 

20 By way of comparison, recombdLnant AcNPV-HIVYKpol (see 
Table 3 at the end of the present disclosure) omits 
NH2-terminal amino acid sequences containing the proteolytic 
active site of the HIV-pol protease. When expressed, the 
restating gene product is not "processed", i.e. the 95 JcD 

25 protein remains intact. 

The following experiments illustrate this. 
Uninfected S. fruaiperda (SF9) cells, or SF9 cell 
infected with recombinant baculoviruses AcNFV-HiVWHpol , 
AcNPV-HIVYKpol or with wild-type AcNPV, were harvested after 

30 72 hours of infection. Lysates of the infected or uninfected 
cells were electrophoresed in a 12% polyacrylamide Laemmli 
gel emd proteins are identified by either Coomassie blue 
staining (S) or Western blot analyses (W) using the standard 
HIH HIV positive immunoglobulin. As shown dLn Figure 2, lanes 

35 1, 2 and 3 represents the lysates of AcNPV-HIvyHpol - 

recombinant virus infected cells, lanes 4, 5 and 6 represent 
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the lysates of AcNPV-HlVYKpol recombinant virus infected 
cells, lane 7 shows the wild-type AcNPV infected cell 
lysate, lane 8 shows uninfected cell lysate and lane 9 shows 
molecular weight markers. Lane 3 and 6 show the whole cell 
5 lysate, lanes 2 and 5 show proteins in the infected cell 
nuclei and lanes l and 4 show proteins in the infected ceil 
cytoplasm. P denotes polyhedrin protein and arrows show 95K 
Dal uncleaved pol gene product representing 91 amino acid 
deletion of protease produced by AcNPV-HIVYKpol virus and 
10 66K Dal, 51 K Dal and 33K Dal processed pol gene products in 
AcNPV-HIVYHpol virus infected cells. 

EXAMPIiE 3 

A. Product ion of pol gene product 

Recombinant ACNPV-HIVY^ol virus infected SpodoptaT-a 
15 fruqiperda (SF9) cells were harvested 4 days after ' 

infection. Nuclei of infected cells containing most of the 
pol gene product were isolated by treating the infected 
cells with 0.1% Triton X-lOO and 0.5% NP40 on ice for 20 
minutes followed by centrifugation at 750 g for 10 minutes. 
20 The pelleted nuclei were denatured with 1% SDS in TRIS-HCl 
PH 8.0 at room temperature for 30 minutes. The cellular DNAs 
were removed by ethanol precipitation using 2 volumes of 
100% ethanol. The SDS in the solution were removed by add- 
ition of 25 mM KCL incubated at 4-C for 30 minutes' followed 
25 by centrifugation at 12,700 g for 15 minutes. The pol gene 
product in the supernatant was used for anti-pol ELISA. 

B. Detection of HT V antiboHigg bv Ri.T ga 

The pol antigen was diluted in PBS and dispensed in a 
microtiter plate (Nunc cat 269620). The concentration of pol 
30 to coat plates was determined empirically on the strength of 
bands on polyacrylamide gels. 

The concentration of pol necessary to coat one well was 
between 1 zaid 10 fig. 

The plate was covered and incubated at 4*C. The time of 
35 incubation varied between 12 and 24 hrs without no appar^t 
differences in reactivity. 
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The plates were then washed three times in PBS tween 20 
employing a Skatron plate washer. 

Various standards, NIH HIV+ immunoglobulin (NIH STD) , 
pool HIV+ plasma (PAT STD) and plasma from non-infected 
5 individuals (NS) were employed. The standards were diluted 
beginning at 1:200 for NIH STD, and 1:10 for PAT STD and NS. 
Unknowns were tested usually at 1:50 but dilutions as high 
as 1:10 ccin be employed. 

All samples were inactivated before testing. Normal sera 

10 were processed in the same fashion as sera from AIDS 
patients. The inactivation was performed with 
4>-aminoethyltrioxsalen- hydrochloride (AMT) from Lee 
Biomolecular Research Inc. (San Diego, California cat 231) 
and an ultra violet light trans-illuminator (Spectroline 

15 model TC-365, Fisher Scientific Ottawa Ont.), The AMT was 
reconstituted in 50% ethanol at 1 jixg/ml. The sera was 
aliquoted in Eppendorf tubes and for every 100 /zl of serum 
or plasma, 10 ^1 of AMT was added to the sample. The 
samples were layed in the transilluminator and irradiated 

20 for 5 minutes. An additional 10 ;il of AMT was added to the 
sample euid the samples were irradiated for a further 5 
minutes. The samples were inactivated by this procedure. 

The incubation time of the human-anti-pol was 30 to 40 
minutes at room temperature (23 -C) (the time of incubation 

25 found to be quite critical) . Therefore, all dilutions of 

standards (negative and positive) and unknowns was performed 
in a separate plate. Once all dilutions were done, the 
dilutions (100 /il) were transferred to the EKESA plate 
coated with pol employing a multichannel pipettor. All 

30 dilutions were with PBS Tween 20 (0.1%). 

The state of the serum or plasma sample was found to be 
important. Samples repeatedly frozen and thawed usually gave 
higher backgrounds. This was especially evident with samples 
from normal individuals. 

35 The plates were washed three times in PBS-Tween 20 after 
the 30 minute incubation with the first antibody. 
A Skatron ll plate washer was employed for this purpose. 
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The second antibody used (goat anti-human Ig linked to 
horse radish peroxidase) was an affinity purified reagent 
obtained from Tago Diagnostics (Inter Medico To DNT cat 
2393) . An appropriate dilution was determined experimen- 
5 tally (approximately 1:2,000) is made in PBS-Tween 20 

(0.1%). 100 Ml was dispensed into the wells except for one 
which will be employed as a blank for the plate reader. The 
plate was incubated for 1 hour at room temperature.^ 

The plates were washed three times with PBS-Tween 20 
10 employing the Skatron II plate washer. 

Freshly prepared substrate (100 ^1) was added to the 
wells and after 20 minutes the reaction stopped with the 
addition of 100 fil of 0.07M H2SD4. 

The plate was read at 450 nm in the BIOTEK BL/310 ELISA 
15 plate reader. A hard copy of the data was obtained from the 
reader and the data also stored directly onto computer 
diskette for further processing by the Anelisar program. 

Additionally, controls were also performed on each 
plate. In two or three wells no serum or plasma was added. 
20 In one well no primary or secondary antibodies were added 
but substrate was. This well was employed to blank the. ELISA 
plate reader. The remaining wells were employed to determine 
the extent of binding of the secondary antibody (Goat 
anti-HIg-HRPO) to POL. Thus, these wells received no primary 
25 antibody but secondary antibody and substrate with the 

appropriate washes in between each incubation. Usually the 
value of this latter control is below O.iooo OD. 

The results are shown in Figure 3. 

The following materials were use for the anti-pol ELISA 
30 procedure 
Buffers 

Phosphate B uffered Saline fPBS) 

NajHPO^ (dibasic anhydrous) 13.6 g 
NaHjPO^ (monobasic) 2.4 g 

35 NaCl 90.0 g 

Salts are dissolved in 8 litres of distilled deionized 
water and pH is adjusted to 7.2 with NaOH or HCl. This 
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buffer is employed as coating buffer, diluent and washing 
buffer. The latter two buffers are modified as indicated 
below. 

Diluent for primary and seconda ry antibodies and waghing 
5 ^u^^er 

PBS + 0.1% Tween 20 (Sigma, St. Louis MO) (0.1 ml Tween 
20 + 100 ml PBS) . The diluent buffer is made up daily. 
Substrate buffer 

Equal volumes of O.IM NajHPO^ (0.709 g/50 ml) and O.IM 
10 citric acid (0.960 g/50 ml). The pH is adjusted to 4.0 with 
NaOH or HCl. The substrate buffer is made up weekly. 
Substrate 

A tablet (2 mg) of o-phenylenediamine (Sigma cat. P6787) 
is dissolved into 10 ml of sxabstrate buffer. Hydrogen 

15 peroxide (4 ^1 of 30%) is added to the solution just prior 
to plating. The solution should be kept in the dark as much 
as possible. 
Stopping reagent 

The enzymatic reaction is stopped with 0.07M HjSO^. 

20 It is a particularly advantageous feature of the 
polypeptides, the use of which is described herein, that 
they cross-react with antibodies against diverse strains of 
HIV. Thus, for example, the polypeptides described herein 
based on HIV-1 can cross-react with antibodies raised 

25 against various strains of HIV-1 and HIV-2. Thus they may be 
used in diagnostic kits for detecting either virus category. 
Similarly, in vaccines they can provide broad-spectrum 
protection. 

Industrial A pplicability 
30 As will be apparent from the above, the present 

invention can be used in the medical field for testing for 
HIV infection and for immunizing against HIV infection, as 
well as for other diagnostic or prognostic purposes. 
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TABLE 1 



HIV-1 pol protein sequence of HIVHXB2 virus 
Data from Hxixnan Retroviruses and AIDS 1988 
Los Alanos National Laboratory 

AcNPV-HlVWHpol 

HIVH}CB 2Met PheP heArgGluAspLeuAloPheLeuGlnGlvLvsAloArQGluPh«R«r>!Cor>Ki ig 

HIVBHll2^=fe _ 20 

HIVBH5 c m 20 

HIVPV22 _ 2D 

HIVBRU Gin 20 

HivmN P 

19 

■ 19 



HIVSF2 



HIVRF Asn Pro— : Leu — ^ 

HI VWAL Asn Pro— -_^p r o i g 

HIVELI —Asn Pro—- Gly Leu ProLys 19 



HIVHXB2 . . . . : GlnThrArg AlaAsnSerProThrArg 28 



HIVBH1 02 ThrArgAIoAsnSerProThrZleSorSerGlu- 

HIVBK5 ThrArgAloAsnSerProThrlleSerSerGlu- 

HIVPV22 ThrArgAlQAsnSerProThrlleSerSerGlu- 

HZVBRU ThrArgAloAsnSerProThrXleSerSerGlu- 

HIVMN — ^ 

H1VSF2 ' 

HIVRF — — 

HIVMAL ■ ■■ 



HIVBH102 ^'^®^^"*"®"*^^"^°^^'^*^^^*'^*®P*®"**"^®*"^'^^®'"^^"^**^^VAlaAs 



HIVBH5 
HIVPV22 
HIVBRU 
KIVPIN 



HIVSF2 GlyGlu Leu 

HIVRF 



HIVBRU 
HIVMN 
HIVSF2 
HIVRF 



-Ile- 



-Ile- 



40 
4»0 
A0 
40 
0 
26 
28 



48 
60 
60 
60 



Leu— ^ _ 60 

0 



48 



Leu-~ GXu- — ■ 47 



HIVMAL Arg Gly . . . LysThrLeu ^Thr Glu 47 

HIVELI —-Arg — . . . ProLeu LysThr Glu 47 

HIVHXB2 GlnGlyThrValSerPheAsnPheProGlnVclThrLeuTrpGlnAroProLeuVolThr 6Q 

HIVBH102 — ^ Xle- 

HIVBH5 — 

HIVPV22 — ^ ■ ■ .. 



' Tr.v 80 

80 

80 

— ~- — B0 

.1. 0 

— — 68 

Ser — He — — — He—- B7 

HIVMAL !!• ser 11« ——Vol 67 

«VELI — xic —^a 67 
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Table 1 cont'd 



<- gag cds end 

HZVHXB2 IleLysIleGlyGlyGlnLouLysGluAlaLouLeuAspThpGlyAloAspAspThrVal BB 



HIVBH102 
HXVBH5 
HIVPV22 
HXVBRU 
HXVMN 



HIVELX 



KXVBH102 
HIVBH5 
HIVPV22 
HXVBRU 



HXVKN Asn Arg 

HIVSF2 Asn ^Lys- 

HIVRF Asn Lys- 

HXVMAL XleAsn Lys- 

HIVELI Asn — Lys- 



HIVBH5 

HXVPV22 

HXVBRU 

HXVWN 

HXVSF2 

HIVRF 

HXVMAL Xle- 

HXVELX 



100 
100 
100 
100 
0 
88 



HXVSF2 Arg — 

HIVRF Vol 

HIVWAL VolArgVol 87 



87 



AcNf^V-HIVYK^l starts 
HXVHXB2 LeuGluGItyietSerLeuProGlyArgTrpLysProLysmetXleGXyGlyXleGlyGly 1 08 



120 
120 
120 
120 
17 
108 
107 
107 
107 



HXVHXB2 PheXXoLysVolArgGlnTyr AspGlnXloUeuXlGGluIleCysGlyHisLys Al aXle 1 28 

HXVBH102 — — — 

HXVBH5 * 

HIVPV22 ■ 

HXVBRU ■ 

HXVMN 

HXVSF2 ProVoX- 

HIVRF . 



140 
1li0 
140 
140 



"Thr Gl y 37 

— 128 

127 



HIVWAL _Ly^ ,27 

HIVELX _^PO~ ^ —Gin- 127 

HIVHXB2 GlyThrVolLeuVolGlyProThrProVolAsnXleXloGlyArgAsnLeuLeuThrGln 1 48 

HXVBH102 



" 160 

— 1B0 

160 

~— 160 

— 57 

148 

_ H|7 

«Bt- — 147 

— 147 
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Table 1 con*-'d 



HIYHXB2 
HIVBH102 

HIVBH5 

HIVPV22 

HIVBRU 

HIVmN Leu 

HIVSF2 

HIVRF 

HIVtnAL 

HIYELI 



\/ p6S, p51 

UeGlvCysThrLeuAsnPhePpoIleSepProIleGluThrVolProVolLysLeuLys 



HXVBH5 
HIVPV22 



HivmN 

HIVSF2 



HZVHXB2 

HIVBH102 

HZVBH5 

HIVPV22 

HIVBRU 



HXVPIN Ile- 

HIVSF2 



HIVRF 



HIVMAL Thr LysAsp- 



HIVHXB2 
HIVBH1 02 
HIVBH5 
HIVPV22 



HIVSF2 



HIVMAL 



168 
180 
180 
180 
180 
77 
168 
167 
167 
167 



J[^V«|JB2^ ProGlyWetAspGlyProLysValLysGInTrpProLeuThrGluGluLy^IleLysAlo 188 



200 
200 



HIVBRU — . 



200 

97 



HIVRF - • ' 

HIVMAL — Arg ZIZI III 

HIVELI — , ^ 



187 



LeuVQlGluneCysThrGluWetGluLysGlu GlyLysIleSerLysIleGlyProGlu 208 
~~~ — ~ — — 220 



220 
220 
220 
117 
208 
207 



Leu — 207 



HIVELI —Thr Asp 



AsnProTyrAsnThrProVolPheAloIleLysLy sLysAspSerThrLysTrpArgLys 228 

— i— 2«>0 



HIVBRU — . -_IZ]III 

HIVMN « -I_-__IZ!_~I~ ~ 2**^ 



137 



HIVRF *~ 228 



227 



HIVELI ^r.^ ■ 227 

227 
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Table 1 cont'd 



HIVHXB2 LeuValAspPheArgGluLeuAsnLysArgThrGlnAspPheTrpGluValGlnUeuGly 248 



HIVBH102 — - n , 260 

HIVBH5 Arg = 260 

HIVPV22 250 

HIVBRU 260 

HIVPW Lys 157 

HIVSF2 248 

HIVRF 247 

HIVmAL Asn ~ 247 

HIVELI 247 

HIVHKB2 XXeProHisProAlaGlyLeuLysLysLysLysSerVQlTbrVolLeuAspValGlyAsp 268 

HIVBH102 1 280 

HIVBH5 280 

HIVPV22 280 

HIVBRU 280 

HIVMN 177 

HIVSF2 268 

HIVRF 257 

HIVMAL 267 

HIVELI 267 

HIVHXB2 AlaTyrPheSerVolProLeuAspGXuAspPheArgLysTyrThrAloPheThrllePro 288 

HIVBm02 300 

HIVBH5 300 

HIVPV22 300 

HIVBRU 300 

HIVMN Lys 197 

HIVSF2 Lys ~ 288 

HIVRF LysGIu 287 

HIVMAL 287 

HIVELI —Ser 287 

HIVHX62 SerlleAsnAsnGluThrProGXylleArgTyrGlnTyrAsnVaXLeuProGlnGlyTrp 308 

HIVBH102 320 

HIVBH5 SerGXy 320 

HIVPV22 320 

HIVBRU 320 

HIVMN 217 

HIVSF2 : . — 308 

HIVRF ^Arg— — 307 

HIVMAL 307 

HIVELI 307 
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Table 1 cont'd 



HIVHXB2 LysGlySorProAlallePheGlnSerSorMetThrtysIleLouGluProPheArgLys 328 

HIVBH102 Lys 340 

KIVBH5 ~ -. 



HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 



HIVBH5 
HIVPV22 
HIVBRU 

mwm 

HIVSF2 
HIVRF 
HIVMAL Lys- 



HIVELI GluMet- 



340 
340 
340 
237 
328 



HIVMAL III 

HIVELI - . Thr 327 

HIVHXB2 GlnAsnProAspIloVallloTyrGlnTyrMotAspAspLouTypVolGlySorAspLeu 348 

HIVBH102 — - ■ , , 



360 
360 
360 
360 
257 
348 
347 
347 
347 



368 
360 
380 
380 
3O0 



HIVHXB2 eiuIloGlyGlnHisArgThrLysIleGXuGluLouArgGlnHisLouLeuArgTrpBly 
HIVBH1 02 — - ■ _ _ „ I ^ , , , ■ ■ - 

HIVBH5 , — .... — , __ . 

HIVPV22 — ■ 

HIVBRU 

HIVMN — AlO Arg _! 277 

HIVSF2 

HIVRF lie- 

HIVMAL Glu lIs 3^ 

HIVELI _-___^ L«« en ^ III 

HIVHXB2 LeuThrThrProAspLysLysHisGlnLysGluPpoPpoPheLeuTrpMetGlyTyrGlu 388 

HIVBH1 fl2 — ■ — 

400 

— 400 

— 400 

400 



HIVBH102 

HIVBH5 Phe 

HIVPV22 

HIVBRU 

HIVMN Phe - ,o- 

HIVSF2 Phe — Zli 

HIVRF Phe 

HIVmAL Phe 

HIVELI Phe Arg 



388 
387 
387 
387 
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Table 1 cont'd 



HXVHXB2 LeuHisProAspLysTrpThrValGlnProIleVQlLeuProGluLysAspSerTrpThr ABB 

HIVBH102 i>20 

HIVBH5 li e 420 

HrVPV22 420 

HIVBRU — 420 

HIVMN ■ 317 

HIVSF2 Met 408 

HIVRF ■ 407 

HIVMAL Gin Asp~Glu 407 

HIVELI Sor Lys Glu 407 

HXVHXB2 ValAsnAspXleGlnLysLeuValGlyLysLeuAsnTrpAloSerGlnXloTyrProGly 428 

HXVBH102 440 

HIVBH5 440 

HIVPY22 — . — — 440 

HXVBRU 440 

HIVMN Alo 337 

HIVSF2 —Alo 428 

HIVRF Ala 427 

HIVMAL > . 427 

HIVELI Asn GluArg 427 

HIVHXB2 XleLysVolArgGlnLeuCysLysLeuLeuArgGlyThrLysAloLeuThrGluVolXle 448 

HXVBH102 460 

HIVBH5 460 

HIVPV22 ■ ■ ■■ 460 

HIVBRU 460 

HIVMN Lys ~ 357 

HIVSF2 Lys 448 

HIVRF Lys Vol 447 

HIVMAL Lys ■ Alo AspIleVol 447 

HIVELI . 447 

HIVHXB2 ProLeuThrGluGluAloGluLeuGluLeuAloGluAsnArgGluXleLeuLysGluPro 468 

HIVBH102 480 

HIVBH5 ■■■ , , , ,„ ., . 

HIVPV22 480 

HIVBRU « . 

HIVMN 377 

H1VSF2 468 

HIVRF Gin Lys 467 

HIVMAL Alo 467 

HIVELI 4fi7 
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Table 1 cont'd 



HIVHXB2 VolHis GlyVolTyrTyrAspProSerLysAspLeuIleAloGluIleGlnLysGlnGly «i88 
" " 500 



HIVBH102 
HIVBH5 
HIVPV22 
HIVBRU 

HivmN 



HIVRF 
HIVMAL 
HIVELI 



HIVHXB2 GlnGlyGlnTrpThrTyrGlnlleTyrGlnGluProPheLysAsnLeuLysThrGlytys 



HIVBH102 
HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 



HIVRF 



HIVBH5 



HIVMN 

HIVSF2 

HIVRF 



HIVELI 



HIVBH5 



HIVBRU 



HIVSF2 VclSer —He- 

HIVRF VolAlo— 



500 
500 
500 



HIVSF2 GXu vol ~ f!I 



<»87 
487 



506 
520 
520 
520 
520 



HIVSF2 ' '^^^ 



508 



HIVMAL —GlnTyr- "^i: ^7 

HIVELI His 1 

' " ■ ' ■ ' ' - 507 

ui^!!!-o '^V''^°^'^^^''9^^YAl°"«ThrAsnAspVolLy5GlnLouThrGluAlaVolGln 528 
HIVBH 1 02 — 1 1 II __. - 



540 



HIVPV22 

HIVBRU Thr . 



540 
437 
528 



HIVMAL IleLysSer.^ Z " 



527 

-Alo 527 



ulln^% '■y^^^^'r^''^^'^g^"SerIleVolIleTrpGlyLysThr^roLysPheLysLeuProIl^ 548 



560 



HIVPV22 „ 



560 
560 



HIVMN Alo— 

—Arg , 



548 
547 



HIVMAL AloGln— . 

HIVELI Arg Ser t^" 

» ^' Arg Arg 547 
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Table 1 cont'd 



HIVHXB2 GlnLysGluThrTrpGluThrTrpTrpThrGluTyrTrpGlnAloThpTrpIleProGlu 568 
HIVBH102 _ 

580 
580 
580 



HIVBH5 — 

HIVPV22 . 

HXVBRU 

HIVMN « 

HrVSF2" « 

HIVRF Alo- 

HIVMAL Alo- 



HIVEUI Alo- 



-Thr-n-i- . 

5B8 

— 567 

567 

5g7 



HIVHXB2 TrpGluPheVolAsnThrProPpoLouValLysLeuTrpTyrGlnLouGluLvsGluPro 588 

HIVBH1 02 " — — — — ■ - I ■ — ^^^ II _ — ■ -- 



HIVBH5 

HIVPV22 

HIVBRU — 

HIVMN Vol- 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI — 



HIVHXB2 
HIVBH102 

HIVBH5 

HIVPV22 

HIVBRU . 

HIVlflN 

HIVSF2 

HIVRF lle- 

HIVMAL 



HIVHXB2 

HXVBH102 

HIVBH5 



HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



-Asp- 
-Asp- 
-Asp- 
-Asp- 




600 
600 
600 
600 
497 
588 
587 



" — —Thr 587 

' 587 

IleVolGlyAloGluThrPheTyrVa lAspGlyAlQAloAsnArgGluThrLysLeuGly 608 

" 620 

. Ser 620 

. ^ -Arg 620 

Ser 620 

■ —Lys 517 



608 
607 



HIVELI lie -- 

— — 607 



LysAlaGlyTyrVoXThrAsn ArgGlyArgGlnLysVolVolThrLeuThrAspThrThr 628 

77 Lys— — Pro— Asn— 640 

HIVPV22 , , 640 

uTuon,, _^ P"* Asn 640 



640 

-Ser- 537 

628 
627 
627 
627 
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Table 1 cont'd 



u^^L *''"''^"'-y*'^*"^^'"-*"'51nAloIloTvrLBuAlQLouGlnAspSor61vLeuGluVQl 6«,B 

HIVBHl 02 — ■ — •-■ , I _ _ „ ^ ^ _ ^ _ 

HIVBH5 , ~ 



HIVPV22 
HIVBRU 
HIVWN 



-His- 



HIVELI 



-Asn- 



HIVHXB2 
HIVBHl 02 
KIVBH5 
HIVPV22 



HIVELI 



660 
660 
660 



Hivsr2 « His ^ III 

HIVRF 

HIVMAL — — «H,-« 

MXS — — Sep 647 



647 



AsnlleValThrAspSerGlnTyrAloLeuG lyllelleGlnAaaGlnProAspglnSer 668 

" ~ ~ — Lys— 600 

~"* — ' Uys 680 

HIVBRU — " 

HIVPIN 
HIVSF2 
HIVRF 



Lys 680 

Uys 577 



-Lys 668 



HIVMAL Lys 667 

HIVELI ^ - ~ 667 

^Lys 667 

SISmL ^^"^""'^^"'■^"^°^^S"g^"Il^IleGluGlnLeuIleLysLysGluLysVa lTyrL^ B88 

HIVBH5 700 

HIVPV22 ""^ 

HIVBRU Gin 700 

HIVMN - — Ser^ : — 0 

HIVSF2 Sep. ■ 

HIVRF — ser 

HIVMAL — iio 177 ] 



-Gin Asp ~ 687 

687 



^"^"'>^°^^"^^°"^=^LysGlyIleGlyGlyAsn61uGlnValAspLysL^ 708 

HIVBH5 720 

HIVPV22 " 720 

HIVBRU ''^ 720 

HIVMN — ^ 720 

HI VSF2 6 ■< 7 

HIVRF ~« _^ ~" ~ — 708 

HIVMAL Ser — Arg-^ 707 

HIVELI — ~ ■ ■■ 707 

— — 707 
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Table 1 cont'd 



HIVHXB2 AloGlyXleArgLysValLeuPheLeuAspGIylleAspLysAloGlnAspGluHisGlu 728 



HIVBH102 : He _ 7*i0 

HXVBH5 lie -Glu 740 

HIVPV22 lie 740 

HIVBRU 740 

HIVMN --GluAsp 637 

HIVSF2 Asn —Glu — 728 

HIVRF Thi 727 

HIVMAL Ser ■ ■ ■ , ■ — qxu 727 

HIVELI Gin —Glu 727 



HIVHXB2 LysTyrHisSerAsnTrpArgAloMetAlaSerAspPheAsnLeuProProValValAla 748 

HIVBH102 — 760 

HIYBH5 760 

HIVPV22 760 

HIVBRU 760 

HIVMN —He 657 

HIVSF2 74S 

HIVRF ' 747 

HIVmAL He 747 

HIVELI Asn 747 



HIVHXB2 LysGluIleValAlaSerCysAspLysCy5GlnLeuLysGly61uAlcs?1etHisGlyGln 768 

HIVBH102 « 700 

HIVBH5 780 

HIVPV22 : 780 

HIVBRU — : 780 

HIVMN B77 

HIV5F2 — 768 

HIVRF 757 

HIVMAL 767 

HIVELI . 767 



HIVHXB2 VDlAspCysSerPpoGlylleTrpGlnLeuAspCysThrHisLeuGluGlyLysVolIle 788 

HIVBH102 800 

HIVBH5 

HIVPV22 

HIVBRU 




800 
800 
800 



HIVMN — — S97 

HIVSF2 

HIVRF 

HIVMAL . 

HIVELI 



788 
787 
787 
787 
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Table 1 cont'd 



hSJ2 '■^""°^^°"°^"^^^°^AloSerGlyTyrneGluAlo61uVoXlleProAl a61uThr 808 

HIVBH5 r~ Q20 

HIVPV22 " 320 

HIVBRU — "~ 820 

HIVMN ' 820 

HIVSF2 ' ~" 717 

HIVRF ' — 808 

HIVMAL He ^ — 807 

HIVELI " " 807 

— 807 

Sm02 ^"V^^"^^"^^^^^°^Y^P^^LeuLeuLysLeuAloGlyAr^TrpProV alLYsTh 828 

HIVBH5 '-^ ' ' 840 

HIVPV22 — , ~~_ ~' " [ 840 

HIVBRU — - — 840 

HIVMN —_—____„ " 840 

HIVSF2 ~ 

HIVRF —He- ~" ^® 

HIVMAL Vol^ 827 

HIVELI — ^VoIVal 827 

*~ — —"VolVal 827 

H1VHXB2^ Has-n.rAspAsnGlySerA3nPhe ThrBlyAloThrVolAr9AloAlaCvsTr.TroAln 848 

H1VBH5 --Ser - Lys 

HIVPV22 — . 860 

HIVBRU — -:s:^;;;;3=:t;;:=~ — 

HIVMN Pro ««,.Th^ r ~ 

HIVRF -^J^]*^^ '-V''- 848 

Sor Ala Lys g^^^ 

wJSmM ^"y^^"'-y''""g^'"'>"^^yI^^P'-oTyrAsnPro61nSerGInGlyVol VoiGlaSer 868 

H1VBH5 ^ 880 

HIVPV22 ~ " 880 

HIVBRU — — " — ■ — 880 

HIVMN ^- 880 

HIVSF2 He — 777 

HIVRF _« 868 

HIVMAL Asn ' — 8B7 

HIVELI — ■ 867 

■ ■* 867 
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Table 1 cont'd 



HIVHXB2 MetAsnLysGluLeuLvsLysIlelleGlyGlnValArgAspGlnAloGluHisLeuLys 888 

HIVBH102 900 

HIVBH5 900 

HIVPV22 900 

HIVBRU ^- 900 

HIVMN 797 

HIVSF2 Asn 888 

HIVRF Gin Gin 887 

HIVIVIAL Glu 887 

HIVELI 887 



HXVHXB2 ThrAlaValGlnMetAlaValPheZleHisAsnPhoLysArgLysGlyGlylleGlyGly 908 



HIVBH102 920 

HIVBH5 : 920 

HIVPV22 920 

HIVBRU - 920 

HIVMN Arg 817 

HIVSF2 908 

HIVRF 907 

HIVMAL . 907 

HIVELI ArgArg 907 



HIVHXB2 TyrSerAlaGlyGXuArglleValAspIlelleAlQThrAspIleGlnThrLysGluLeu 928 

HIVBH102 ' 940 

HIVBK5 940 

HIVPV22 940 

HIVBRU ~ 940 

HIVMN 61 y— 837 

HIVSF2 ■ — ^ 928 

HIVRF 927 

HIVMAL He— Met 927 

HIVELI He 927 



HIVHXB2 61nLysGlnIleThrLysIleGlnAsnPheArgValTyrTyrArgAspSerArgAsnSer 948 

HIVBH102 — Pro 960 

HIVBH5 — Pro 960 

HIVPV22 -Pro 9G0 

HIVBRU . — AspPro 960 

HIVMN AspPro 857 

HIVSF2 ' ' — AsnLysAspPro 948 

HIVRF AspPro 947 

HIVMAL — — /.sn AspPro 947 

HIVELI He ■ AspPro 947 
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TJible 1 cont'd 



ull^u^L '-^""^'■P^y®*^^y''''^'^^°Ly^LeiiLeuTrpLysGlyGluGlyAlaVoXVolIleGlnAsp 
HIVBH102 — .1 , 

HIVBH5 . " 

HIVPV22 — Z Z.. 

HIVBRU — — - 

HIVMN . 

HIVSF2 — 

HIVRF — His 

HIVMAL lie— . ' 

HIYELI IlG , " 

967 



968 
960 
980 
980 
980 
877 
968 



HIVHXB2 AsnSerAs^^ 



HIVBH102 
HIVBH5 



HIVSF2 

HIVRF 

HIVMAL 



HIVHXB2 * 
HXVBH102 

HZVBH5 

HIVPV22 

HIVBRU 

HIVIVW Thr 

HIVSF2 



61nMetAloGlyAspAspCysVolAloSerArgGlnAsp61uAsp+++ 1 004 

" — — 1016 

1016 

' ^ ~ 1016 

--JIZIIIIII~~~ — 1016 

" — — 913 

" ""^^^ 

I!^.^!: — GlyGly JJ^ 



HIVRF 
HIVMAl 
HIVELI 



1000 



HIVPV22 « — ~Z '^^^^ 

HIVBRU — . . " 

HIVMN Asn ~— 



1000 

Vol _ 



988 
987 



HIVELI Lys ^ol ..SUZ 9^ 



1003 
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TABLE 2 

HIV-l pol gene HIVHXB2 Sequence 

Data from Human Retroviruses and AIDS, 1988 

Los Alamos National Laboratory 

AcNPV-HIVWiDol Virus 

~ " BBS 

Bam HI tl 
5'-GGATCCTATAAATATC tttttta gggaogotct 
pol cds start (NH2-terndnus uncertoin) -> 



2101 


ggccttccta coogggoogg ccogggaott 


ttctteogog 


cagoccagag 


ccoQcogccc 


2161 


cQccogoaga gogcttcogg tctggggtog 


agocoQcoac 


tecccctcag 


oogcoggogc 


2221 


cgotogocaa ggaaetgtot cctttaoctt 


ccctcaggtc 


actctttggc 


aacgocccc-t 


2281 


cgtcQcoaTA Aogotogggg ggcaoctaoa 
<- gog cds end 


ggaogctcta 


ttogatocag 


gagcagatga 


2341 


tacagtotta% googoQatga 


gtttgccogg 


aagatggaao 


ccaaaoQtga 


togggggaat 


2401 


tggaggtttt atcaaogtaa 


gacagtotga 


tcogotacte 


Q'togoocrtct 


gtggacotoo 


1 


ogctataggt ocagtottog 


taggacctac 


occtgtcoac 


atoottggaa 


goaa-tc'tgt't 


2521 


gactcagott ggttgcoctt: 


taaattttec 


cottogccct 


attgogoctg 


toccogtooa 




ottQoogcca ggocrtggatg 


gcccoaoagt 


taaocaatgg 


ccottgacag 


aagaaoaaat 




Doaagcatte gtogooattt 


gtocogagat 


ggaoooggaa 


gggaaaofbt 


CQQooattgg 


^7 tit 


gcctgaaoa't ccotacoata 


ctccagtott 


tgccoteaog 


aaoaoogaca 


gtactaaatg 


2761 


gogooaatta gtogotttca 


gagoacttaa 


taagagaact 


CQogacttct 


gggaogttca 


2821 


ottoggooto ccaeotcccg 


cogggttaaa 


aoagaaaaaa 


tcagtoocog 


tactggatgt 


2881 


gggtgotgco tottttteog 


ttcccttoga 


tgaogocttc 


oggaogtoto 


ctgcatttac 


2941 


cotocctogt otooacoDtg 


ogacaccogg 


gottogotat 


cagtocaotg 


tgcttccoca 


3001 


gggatggaoci ggatcoccog 


COOtOttCCQ 


oogtogcotg 


ocoaoaatct 


tagagccttt 


3061 


togaaoocoa oatecagaco 


togttatcta 


tcoatacotg 


gotgottHgl: 


QHgtaggatc 


3121 


tgocttogoa crtagggcogc 


otagaocooQ 


□atogoggog 


ctgagocaoc 


atctgttgog 


3181 


gtggggactt accQcaccog 


Qcoaaaoaca 


tcogaoagaa 


cctccottcc 


tttggatggg 


3241 


ttatgaactc cotectgota 


ootggacogt 


acogcctato 


gtgctgccag 


ooaaogacog 


3301 


ctggoctgtc aatgacotac 


ogaogttogt 


ggggaaattg 


oottgggcDO 


gtcogottta 


3361 


cccogggott ooogtoaggc 


oottotgtaa 


octccttaga 


ggooccoQog 


coctoocoga 


3421 


ogtoatacca ctoocogoog 


Qogcogogct 


ogaactggco 


gaoaocogag 


ogottctoao 
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Table 2 cont'd 



3481 ogaoccogta cotggogtigt ottotgoccc atcoaoogac ttootogcog aaatacogoa 

3541 gcoggggcao ggccoatggo catatcooot ttatcoogag ccatttaooo atctgooaac 

3S01 oggaaaotat gcoogaatga ggggtgccca cactootgot gtoooocoot toacogoggc 

36G1 ogtgcooooa otooccocog aaogcatogt oototgggga aogoctccto oatttooact 

3721 gcccatocQQ aoggaoacat gggoaocotg gtggocogog tattggcaog ccocctggat 

3781 tcctgogtgg gogtttgtta otacccctcc cttogtgoaa ttatggtocc agttogogoa 

3841 ogoacecato gtaggogcag oaoccttcto tgtogatggg gcagctDQca gggogactoa 

3901 attoggaoaa gcoggototg ttoetoatog oggoogacoo ooogttgtco ccctooctgo 

3961 coeaaeaaat cogoagaetg ogttoeaogc ootttotcta gctttgeogg ottcgggott 

4021 agoogtoaae atagtoaeog acteaeaota tgeottogga otcotteoog eocoaeeaga 

*081 teoaagtgoo teogagtteg teaatcaoot aotogogeog ttootaoaaa oggoaoaggt 

*141 etatetggca tgggtoccog eacaeoaagg oottggogga ootgoacoog togatooott 

4201 ogtcagtget ggaotcaggo oagtDctatt tttogotggo otogotaogg eeeaagatgo 

4261 aeotgagaaa toteocagto ottggogoge oatggctogt gottttaocc tgccocctgt 

4321 ogtogeoaaa gooatogtag ecogetgtgo toootgteog etooaaggog aogccptgco 

4381 tggoeaogta gaetgtogte coggoototg gcaoetogot tgtococott togaaggooa 

4441 ogttotcctg gtogcogtte otgtogeeog tggotatota gaogcQgaag ttottccogc 

4501 ogoooeoggg eoggoooeog eotottttct tttooootta geoggaogot ggceogtooo 

4561 oaeooteeat aetgaeaatg geogeoattt eoecggtget ocggttoggg ccgcctgttg 

4621 gtgggegggo ateaagcogg aatttggoot tceetaeoot eeceaoogte aoggogtogt 

4681 ogoatetotg OQtaoogaot taaogoaoat totoggacog gtoogogotc aggctgoaco 

4741 tettaagoco geagtocooa tggcogtott cotccacoat tttaaaogoo aoggggggot 

4801 tggggggtoe agtgeogggg ooogoatogt ogaeataota geoocagaea tocoooctoo 

4861 ogoottacaa aoocoaatta CQOQoottea ooottttcgg gtttattocA Gggocogcog 

/\ 3'sj. 

4921 ooottcoett tggaaoggoe eogeoooget cctctggooo gGTgoagggg cogtagtaot 

5'sj A 

4981 oeoogotoot cgtgocotoo oogtogtgcc oogaogoooo gcooogotco ttogggottA 

sor 23 kO cds start -> 

5041 TCgaooocog o^ggcaggtg otgottgtgt ggcoogtogg coggotooga otTA^ ,.. 

Crossover <- poi end 

linker Mquence 
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TABLE 3 

HIV-l pol gene 
HIVHXB2 Sequence 

Data from Himan Retroviruses and AIDS, 1988 
Los Alamos National Laboratory 

AcNPV*HTVW»Pol Vir us 

RBS 

Bam HI tl (AcNPV-HIVWHpqI start) 

S'-GGATCCTATAAATATGkttttto gggoagatct 
pol cds start (NH2-tenninu5 uncertain) -> 

2101 ggccttccto caogggaogg ccogggaatt ttcttcogog cogaccagag ccaocagccc 

21 G1 coccagaoga gogcttcagg tctggggtog agocaacaoc tccccctcog oogcoggogc 

2221 cgatogocoa ggaoctgtat cctttooctt ccctcaggtc octctttggc oacgocccct 

2281 cgtcacooTA Aagotogggg ggeooctooo ggaagctcta ttagotacog gagcagotga 

AcNPV'HIVYKpol Bam HI RBS ti 

Virus 5 ' -GGATCCTATAAA TATG Cti; AcOTV-HIVYKpol start) 

2341 tacagtattoi gaagoootgo gtttgccogg oagotggaao ccaaaaotgo togggggaat 
2401 tggoggtttt otcaoogtao gacogtotga tcogotoctc atogooatct gtggacotoo 
2461 agctotaggt ocogtattog toggocctoc occtgtcoac otoottggaa goootctgtt 
2521 goctcagott ggttgcoctt tooottttcc cottagccct ottgagactg taccogtooo 
25B1 ottooagcco ggoatggotg geecaoaogt tooocootgg ccattgocag oogoaoadat 
2641 oooagcotta gtogooottt gtoeogagot ggooooggoo gggaaoattt coooaottgg 
2701 gcctgooaot ecotocaato ctccogtatt tgccotooag aaoaoogoco gtactoaatg 
2761 gogooaatta gtogatttca gagaocttoa taogagaact caagacttct gggoagttco 
2821 ottaggaato ccacoteccg cogggttoao aaagaaaooa tcogtoocog toctggotgt 
2881 gggtgotgca tatttttcag ttcccttaga tgoagocttc oggoagtoto ctgcotttoc 
2941 cotocctagt atooocoatg ogoeoccogg gottogotat cogtocaatg tgcttccoco 
3001 gggatggooQ ggotcoccag coatottcca oogtogcotg ocooooatct tagogccttt 
3061 togooaocao oatccogaca tagttotcto tcoatoeatg gatgotttgt atgtoggotc 
3121 tgacttagao otogggeogc otogoocooa ootogaggog ctgogocooc otctgttgog 
3181 gtggggoctt aceocoeeog ocooooaDca tcogooogaa cetccottcc tttggotggg 
3241 ttotgooctc cotcctgoto ootggocogt ocogcctato gtgctgccag ooaoogocog 
3301 ctggactgte ootgacotac ogaogttogt ggggooattg oottgggcoo gtcogottto 
3361 cccogggott ooogtooggc oattotgtoo actccttogo ggaoccaaog cactoocogo 
3421 ogtootocco ctaocagaog oogcagogct agooctggca gooaocogog agottetooo 
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Table 3 cont'd 



3*8T oBooccogta cotggagtgt ottotgoccc otcoaoogoc ttootogcog ooatocogoo 
35*1 gcoggggcoa ggccootgga cototcoaot ttotcoogog ccotttooco otctgooooc 
36a, oggocaoft gcoogootgo ggggtgcccc eoctootgot gtoooacoot toocogoggc 
366, ogtgcooooo otoaccocog ooogcotogt ootatggggp oogoctccta cotttoooct 
372, gcccotocoo oaggooocot gggooocotg gtggacogog tottggcaog ccocctggot 
378, tcctgogtgg gogtttgtto otocccctcc cttogtgaoo ttotggt.cc cgttogogao 
384, ogaocccoto gtcggogcog cooccttcto tgtogotggg gcagctooco gggogoei:aa 
390, cttoggacoo gcoggototg ttoctootog oggoogacoa ooogtt^co ccctooctgo 
396, coccocooot cogoogoctg ogttocoogc ootttotcto gctttgcogg ottcgg^ptt 
402, ogoogtoooc otogtoocog octcocoota tgcottoggo otcottcoog cacooccoga 
«8, tcoaagtgoa tcogogttcg tcootcpoot ootogogcog ttootooooa oggooooggl. 

ctotctggco tgggtcccog cocpcooogg oottggoggp ootgoocoog togotooatt 
4201 ogtcogtgct ggootcoggo ocgtcctott tttagotggo ctogotoogg cccoogotgo 
426, ocotgogooo totcocogto ottggogogc ootggctogt gottttoocc tgccocctgt 
432, ogtogcocoo goootcgtog ccogctgtgo tooatgtcog ctooooggog oogccotgco 
438, tggocoogto goctgtogtc coggoototg gcooctagot tgtccocott tcgooggoao 
444, ogttctcctg gtogcogttc otgtogceog tggototota googcogoog ttattccogc 
450, ogooocoggg coggooocog catottttct tttooootto gcoggoogot ggccogtcoo 
456, oococtacot octgocoatg gcogcoottt coccggtgct ocggttoggg ccgcctgttg 
462, gtgggcggga otcoogcogg ootttggoot tccctocoot ccccooogtc ooggogtogt 
468, ogootctotg ootoocgoot tooogoooct totoggoccg gtoogogatc oggctgooca 
474, tcttocgoco gcogtocooo tggcogtott cotccocaot tttcaoogoo ocgggggg,^ 
4601 tggggggtoc ogtgcogggg ooogootogt ogocotoota gcoocogoco tocoooctoo 
486, ogoottocoo ooocoootto cooooottee ooottttcgg stttottocA-Oggocogtag 

/\ 3<sj 

492, ooottcoctt tggoooggoc cogcooogct cctctggooa gGTgoogggg cogtogtoot 

5'sj A 
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Table 3 cont'd 



4981 acaogotaat agtgacotaa oogtagtgcc oogoogoooa gcooagatca ttogggcrttA 

Bam HI 

5041 TGgoooacag atggcoggtg atgattgtgt ggcoogtago coggotgogg otTAGSATCC 

•Sph I 
GCATG-3 • 
<- pol end 

5101 ggoaoagtt't ogtaaaacac co'tatgtat.g tttcagggoa ogctogggga 'tggtt't'to'ta 

5161 gocateocta 'tgaaogccct cotecaagaa toagttcoga ogtococotc ccoctogggg 

5221 atgctagat't ggtaatoaca acotat.tggg gtctgcatoc aggagoaoga gartggco'tt: 

5281 tgggteoggg ogtctccoto gaatggagga aaaogogoto togcococoa gtogoccctg 

5341 aactogcaga ccooetoatt cotctgtcrtt octttgactg tttttcAGac tctgctirtaa 

A 3*sj 

5401 gaaaggcctt attaggacoc atogttagcc ctaggtgtgo atotcaogco ggacatooco 

5461 ogGTaggotc tetocoatoc ttggcoctag cagcattoot aacoccoaaa oagatooogc 
5»si A 

5521 cocctttgcc togtgttacg ooactgocog oggatagATG goacaogccc cogoogocca 

R orf cds start -> 

5581 agggccacag agggogccac ocaatgoatg gacacTAGag cttttogogg ogcttaogaa 

<- son 23 kD cds end 

5641 tgaogctgtt agocottttc ctoggatttg gctccatggc ttagggcaoc atotctotgo 

5701 oaettotggg gatacttggg caggogtgga agccotaoto ogoattctgc oacooctgct 

5761 gtttatccot tttcAGaatt gggtgtcgoc oTAGcagoat aggcgttoct cgocogogga 
/\ 3'sj <- R orf cds end 

5821 gogeoogoaA TGgagccagt ogotcctaga ctogogccct ggoogcotcc oggaogtcog 
tot cds start -> 

5881 cctoaaactg cttgtaccoa ttgctattgt ooaoogtgtt gctttcattg ccaagtttgt 

5941 ttcataocaa aogccttogg cotctcctAT GgcAGgaago ogcggogoca gcgacgoago 
trs/ort cds start -> /\ 3'sj 

6001 gctcotcogo ocagtcogoc tcotcoogct tctctotcaa ogcoGTaagt ogtocatgta 

(tat, trs/art. 27 kD) S^sj /\ 

6061 AcGcoaccto taccaotagt ogeaatagta gcattogtog tcgcootoot ootogcoata 
U orf -> 

6121 gttgtgtggt ccatagtoot catagoatot aggaoootot toogocaoog oaoaatagoc 
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CLAIMS: 

1. The use of a polypeptide as a reagent in a 
diagnostic test for HIV infection or as a vaccine against 
HIV infection characterized in that said polypeptide 

5 comprises a substantial portion of each of more than one of 
the enzymes coded for by the HIV-pol gene. 

2. The use claimed in Claim l characterized in that 
said polypeptide comprises a plurality of enzymes selected 
from HIV-pol protease, HIV-pol reverse transcriptase, 

10 HIV-pol RNAse H and HIV-pol Integrase. 

3. The use claimed in Claim 2 characterized in that 
said polypeptide comprises substantial portions of all four 
of said enzymes. 

4. The use claimed in Claim 2 characterized in that 

15 said polypeptide omits at least that part of the amino acid 
sequence of the HIV-pol protease gene which codes for the 
active site responsible for proteolytic activity. 

5. The use claimed in Claim 3 characterized in that said 
polypeptide omits at least that part of the amino acid 

20 sequence of the HIV-pol protease gene which codes for the 
active site responsible for proteolytic activity. 

6. A diagnostic kit for detecting antibodies to HIV 
antigens characterized in that said kit contains as a test 
reagent, a polypeptide as defined in Claim 1, claim 2, 

25 Claim 3, Claim 4 or Claim 5. 

7. A vaccine for protecting an individual against HIV 
infection comprising a polypeptide and a pharmaceutically 
acceptable carrier, characterized in that said polypeptide 
xs as claimed in Claim 1, claim 2, Claim 3, Claim 4 or 

30 Claim 5. 

8. A polypeptide comprising a substantial portion of 
each of more than one of the enzymes coded for by the 
HIV-pol gene characterised by omitting at least that part of 
the amino acid sequence of the HIV-pol protease gene which 

35 codes for the active site responsible for proteolytic 
activity. 
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9. A polypeptide as claimed in Claim 8 characterized by 

comprising sequences of a plurality of enzymes selected from 
HIV-pol protease, HIV-pol reverse transcriptase, HIV-pol 
RNase H emd HIV-pol Integrase. 
5 10. A polypeptide according to Claim 9 characterized 

in that said polypeptide contains substantial portions of 
all foxir of said enzymes. 

11. A polypeptide according to Claim 8 characterized in 
that said polypeptide has an amino acid sequence 
10 substantially as shown in Table 3 beginning with the amino 
acid Met marked "AcNPV-HXVYKpol starts". 
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